Approximating Large Frequency Moments with O(n1-2/k) Bits
نویسندگان
چکیده
In this paper we consider the problem of approximating frequency moments in the streaming model. Given a stream D = {p1, p2, . . . , pm} of numbers from {1, . . . , n}, a frequency of i is defined as fi = |{j : pj = i}|. The k-th frequency moment of D is defined as Fk = ∑n i=1 f k i . In this paper we give an upper bound on the space required to find a k-th frequency moment of O(n1−2/k) bits that matches, up to a constant factor, the lower bound of [46] for constant and constant k. Our algorithm makes a single pass over the stream and works for any constant k > 3.
منابع مشابه
An Optimal Algorithm for Large Frequency Moments Using O(n^(1-2/k)) Bits
In this paper, we provide the first optimal algorithm for the remaining open question from the seminal paper of Alon, Matias, and Szegedy: approximating large frequency moments. Given a stream D = {p1, p2, . . . , pm} of numbers from {1, . . . , n}, a frequency of i is defined as fi = |{j : pj = i}|. The k-th frequency moment of D is defined as Fk = ∑n i=1 f k i . We give an upper bound on the ...
متن کاملThe Simultaneous Communication of Disjointness with Applications to Data Streams
We study k-party set disjointness in the simultaneous message-passing model, and show that even if each element i ∈ [n] is guaranteed to either belong to all k parties or to at most O(1) parties in expectation (and to at most O(logn) parties with high probability), then Ω(nmin(log 1/δ, log k)/k) communication is required by any δ-error communication protocol for this problem (assuming k = Ω(log...
متن کاملA Tight Lower Bound for High Frequency Moment Estimation with Small Error
We show an Ω((n1−2/p logM)/ ) bits of space lower bound for (1 + )-approximating the p-th frequency moment Fp = ‖x‖p = ∑n i=1 |xi| of a vector x ∈ {−M,−M+1, . . . ,M} with constant probability in the turnstile model for data streams, for any p > 2 and ≥ 1/n (we require ≥ 1/n since there is a trivial O(n logM) upper bound). This lower bound matches the space complexity of an upper bound of Gangu...
متن کاملApproximating Large Frequency Moments with Pick-and-Drop Sampling
Given data stream D = {p1, p2, . . . , pm} of size m of numbers from {1, . . . , n}, the frequency of i is defined as fi = |{j : pj = i}|. The kth frequency moment of D is defined as Fk = ∑n i=1 f k i . We consider the problem of approximating frequency moments in insertion-only streams for k ≥ 3. For any constant c we show an O(n log(n) log(n)) upper bound on the space complexity of the proble...
متن کاملA The Value of Multiple Read/Write Streams for Approximating Frequency Moments
We consider the read/write streams model, an extension of the standard data stream model in which an algorithm can create and manipulate multiple read/write streams in addition to its input data stream. Like the data stream model, the most important parameter for this model is the amount of internal memory used by such an algorithm. The other key parameters are the number of streams the algorit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1401.1763 شماره
صفحات -
تاریخ انتشار 2014